Zero-Shot Object Detection with Textual Descriptions
نویسندگان
چکیده
منابع مشابه
Zero-Shot Detection
As we move towards large-scale object detection, it is unrealistic to expect annotated training data for all object classes at sufficient scale, and so methods capable of unseen object detection are required. We propose a novel zero-shot method based on training an end-to-end model that fuses semantic attribute prediction with visual features to propose object bounding boxes for seen and unseen...
متن کاملTextual Inference for Retrieving Labeled Object Descriptions
This thesis presents a knowledge-based solution for retrieving English descriptions for objects in a collection. Based on detailed analysis of the errors made by a baseline system relying on surface-level features (i.e. term frequency), we infer that an ideal solution to this problem should use deeper representations of the meaning encoded in textual descriptions. Applied Textual Inference (ATI...
متن کاملFew-shot Object Detection
In this paper, we study object detection using a large pool of unlabeled images and only a few labeled images per category, named “few-shot object detection”. The key challenge consists in generating trustworthy training samples as many as possible from the pool. Using few training examples as seeds, our method iterates between model training and high-confidence sample selection. In training, e...
متن کاملSingle-Shot Object Detection with Enriched Semantics
We propose a novel single shot object detection network named Detection with Enriched Semantics (DES). Our motivation is to enrich the semantics of object detection features within a typical deep detector, by a semantic segmentation branch and a global activation module. The segmentation branch is supervised by weak segmentation ground-truth, i.e., no extra annotation is required. In conjunctio...
متن کاملZero-shot Object Prediction using Semantic Scene Knowledge
This work focuses on the semantic relations between scenes and objects for visual object recognition. Semantic knowledge can be a powerful source of information especially in scenarios with few or no annotated training samples. These scenarios are referred to as zero-shot or fewshot recognition and often build on visual attributes. Here, instead of relying on various visual attributes, a more d...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the AAAI Conference on Artificial Intelligence
سال: 2019
ISSN: 2374-3468,2159-5399
DOI: 10.1609/aaai.v33i01.33018690